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Abstract. In 1986, Osherson, Stob and Weinstein asked whether two vari- 
ants of anomalous vacillatory learning, TxtFex* and TxtFext*, could be dis- 
tinguished [3]. These learning criteria place bounds neither on the number of 
hypotheses between which the learner is allowed to vacillate nor on the number 
of errors permitted, merely that both are finite. The criteria differ in that the 
more restrictive one, TxtFext*-learning, requires that all hypotheses output 
infinitely often must describe the same finite variant of the correct set, while 
TxtFexJ permits the learner to vacillate between finitely many different finite 
variants of the correct set. In this paper we show that TxtFex!|^ ^ TxtFext* , 
thereby answering the question posed by Osherson, et al. We prove this in a 
strong way by exhibiting a family in TxtFexj \ TxtFext J. 



1. Introduction 

In order to prove TxtFex* ^ TxtFext*, we explicitly construct a family that 
is TxtFex*-learnable, but not TxtFext*-learnable. We diagonalize against every 
attempt to TxtFext*-learn the whole family by including, for each machine, a sub- 
family that witnesses the machine's failure to TxtFext* -learn. Each subfamily is 
produced by means of an effective construction and the whole family is uniformly 
computably enumerable (u.c.e.) 

All sets considered are subsets of the natural numbers and all families are collec- 
tions of such subsets. We will use (a:, y) to denote a computable pairing function. 
We will regard a set as coding a countable family, whereby each member of the 
family is coded as a column of the set. In other words, if A is to code a family of 
sets, then the i*'' set of the family is {x : {i,x) £ A}. Given natural numbers e 
and s, We^s will denote the result of computing the set coded by e, up to s stages, 
using a fixed computable numbering of c.e. sets. Let _D„ denote a fixed computable 
enumeration of finite sets. Lower case Greek letters will typically refer to strings of 
natural numbers. Enumerations (called texts in learning theory) will be treated ei- 
ther as infinite strings or as functions on the natural numbers, depending on which 
is most appropriate in the given setting. Initial segments of enumerations will fea- 
ture throughout this paper and will either be denoted by lowercase Greek letters, 
as mentioned above, or by initial segments of functions (e.g. T[n\ in learning theory 
notation, or / f n in standard logic notation). Elements of an enumeration will 
be denoted by T{n) or f{n). To switch from finite ordered lists to unordered sets, 
we say that content(CT) = {x S N : 3n(x — <7{n))}. For enumerations, we extend 
the content notation to denote the set that is enumerated. We indicate that the 
symmetric difference of A and B has cardinality less than a by A =" i?. We extend 
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this notation to include the case where the symmetric difference is merely finite by 
replacing a with 

We will consider a fixed computable enumeration of all effective learning ma- 
chines, functions from to N, where Mg denotes the learner coded by e. In 
general, learners will be denoted by M . 

Definition 1.1 ([1]). Let G NU {*}. The definition of TxtFex* -learning is in 
four parts. 

(1) M TxtFex* -identifies a text T if and only if there is a set S with \S\ < j (or 
merely finite S' if j = *) such that (V°°n)(Va € S){M(T[n]) e S AWa 
content (T)). 

(2) M TxtFex* -identifies a c.e. set A if and only if M TxtFex* -identifies every 
text for A. 

(3) M TxtFex^-learns a family of c.e. sets if and only if it TxtFex* -identifies 
every member of the family. 

(4) is TxtFeXj-learnable (denoted J-" e TxtFexJ) if and only if there is a 
machine M that TxtFexJ -learns J^. 

Definition 1.2 (0). Let i,j £ NU {*}. The definition of TxtFext*-learning is 
analogous to that of TxtFex^ . 

(1) M TxtFext^ -identifies a text T if and only if there is a set S with \S\ < j (or 
merely finite if j = *) such that (V°°n)(Va, b G S){M{T[n]) &S AWa = 
Wb content (T)). 

(2) M TxtFext* -identifies a c.e. set A if and only if M TxtFext* -identifies every 
text for A. 

(3) M TxtFextj- -learns a family of c.e. sets if and only if it TxtFext* -identifies 
every member of the family. 

(4) is TxtFext*-learnable (denoted J- G TxtFext*) if and only if there is a 
machine M that TxtFext*- learns J^. 

This paper is concerned with TxtFex* and TxtFext*. Inspection of the definitions 
reveals that TxtFext* is at least nominally more restrictive than TxtFex* and hence 
a weaker learning criterion in the sense that every TxtFext*-learnable family is 
also TxtFex*-learnable, i.e. TxtFext* C TxtFex*. The following two theorems 
tantalizingly hinted that TxtFext* might be equivalent to TxtFex*. As we shall 
see, this is not the case. 

Theorem 1.3 (Fulk, Jain, Oshcrson). (Vi, j G n){TxtFexi C TxtFextf), where c 
depends only on i. 

Theorem 1.4 (Fulk, Jain, Osherson). (Vj G N){TxtFexl C TxtFexQ. 
For proofs of these theorems, see [2]. 

2. TxtFex: ^ TxtFext^ 

We will construct the family witnessing the main result by diagonalizing against 
every machine's attempt to TxtFext^-learn. Each step of the diagonalization will 
produce a family, henceforth denoted Ce, which Me cannot TxtFextJ-learn. 

The intuitive concept behind the diagonalization is to search for a string, cr, on 
which the learner commits to hypothesizing a finite number of different codes for 
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the same set on all extensions. Such a string may not exist, but the construction 
will be such that if no string can be found, the family under construction will be 
certain to include a set, on some enumeration of which, the machine never commits 
to output only hypotheses that code a single fixed set. If such a string does exist, 
we will arrange for the existence of two sets in the family that contain content((T) 
and have infinite symmetric difference. Otherwise, the learner against which we are 
diagonalizing has already failed to TxtFext*-learn. 

Theorem 2.1. There is a u.c.e. family, C, that is TxtFexl-learnable, but is not 
TxtFextX-learnable. 

Proof. Fix a learner, Mg. We begin by describing what is needed to prevent Mg 
from TxtFext*-learning. The result of this step will be a family Ce- For notational 
convenience, let Le denote {e, e + 1, . . .}, which may be a member of Ce depending 
on the course of the construction. Motivated by our interest in strings on which Mg 
has committed to a finite collection of hypotheses, we make the following definition. 

Definition 2.2. A string a is said to be an (e, A:)-stabilizing sequence if and only 
if the following conditions are met for all r ^ cr and t € N: 

(1) {e, e + 1, . . . , e + A;} C content(cr) C content(T) C Lg 

(2) Mg(r) < \a\ 

(3) WM^(a)^a\+t n [0, k) = WM^(r),\a\+t H [0, k) . 

In essence. Definition 1 2 . 21 describes strings that adhere to a certain form (namely, 
that they are contained in and contain a suitable initial segment of Lg), define 
cones in uj^'^ on which Afg outputs no new hypotheses and that the hypotheses 
Mg outputs on extensions of the strings are equal. Since this last claim cannot be 
verified in the limit (it is a II^ predicate), the above definition describes a finite 
approximation. 

The predicate "ct is not an (e, A:)-stabilizing sequence" is as it requires only a 
witnessing string and natural number to verify. Thus we may define a sequence of 
strings that converges in the limit to an (e, 0)-stabilizing sequence, (Tg.o, if such a 
string exists. Extending such a strategy, we may define ae,k,s for all fc, s G N, such 
that 

• cre,k,s < cTeM+i.s for aU /c, s G N 

• fg.o.Oj fe.o.ij • • • converges to an (e, 0)-stabilizing sequence, if one exists 

• If <Je,k,a, <^e,k,i, ■ ■ ■ couvcrges to a string ae,k for all k < n, then 

fe.n.Oj Cg.n^i, . . . convcrgcs to an (e, n)-stabilizing sequence, (Te,n, that ex- 
tends CTg.fe for all k < n, ii such a ae,n exists. 

If k is least such that <7e.k.o, Ce,fe,i, • • ■ does not converge, then ae.m,0: o'e.m.i, ■ ■ • 
will never converge for m > k and there are no (e, m)-stabilizing sequences that 
extend a^^k-i- Define Og^^ to be the least even number greater than e + fc + 1 such 
that (Je,h.s = '^e,h.s+i for all ft, < fc and s > ae.k- Let be,k — le.fc + l- These numbers 
will allow us to monitor the convergence of the ae,k,s and control the construction 
as appropriate. 

Define two sets 



i?g = G Lg : Vfc(a; ^ a^^k)} 
Rl = {x e Le ■■ yk{x 7^ be^k)}- 
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Observe that Re is c.e. Because ae.o < ae,i ^ ■ • to decide whether or not 
X G Re, one need only determine whether or not x ^ ae,k for all fc such that 
ae,fc < X. Similarly, substituting 6e,fe for ae,fe, we see that i?* is also c.e. We are 
now in a position to define £e- 

£e = {-Re U (£)„ n [e, oo)) : n G N} U {i?* U n [e, oo)) : n G N} 

Recall that Dq,D\,... enumerates all finite sets. We now return to Me, the 

learner against which we are currently diagonalizing. We must prove that Mg is 
incapable of TxtFext*-learning Lg. Two cases encompass the possible ways in which 
Me will fail to learn L^- 

Case 1: Suppose that there is a minimal fc ^ such that a^^h is undefined. By 

definition, this implies there is no u extending CTe,fe-i such that e + fc G content((j) C 
Le and Wm^{(j)\(j\+s ^ [0) ^) = M^Me(T),|<T|+s ^ [Oj ^)) for all t such that a <t with 
content(T) C Z/e- Furthermore, since o^^k is undefined, 0^,% is undefined for all 
i > k. Consequently, Og.i is undefined for all i > fc and both Rg and i?* are cofinite 
subsets of Lg. For a suitable finite set, D„, we have Re U = Lg) and hence, 
Le G £e- By repeatedly selecting extensions on which Me outputs hypotheses 
coding distinct sets, we can inductively build an enumeration of Lg on which Me 
infinitely often outputs codes for at least two sets that are not equal. If fc = 0, there 
is the additional possibility that Mg cannot be made to select a finite collection of 
hypotheses and restrict its output to that finite list. The machine has failed to 
TxtFextJ-learn Le in either case. 

Case 2: Suppose that ae,k is defined for all k. Both Re and i?* are coinfinite 
sets and have infinite symmetric difference. By the definition of ae,o, for any t such 
that tTe,o -< T and content (r) C Le, Me{T) < \cre,o\- We may therefore define a 
finite list, ho,hi,. . . , /i„, of all distinct hypotheses that Me outputs on extensions 
of (Je,o- Pick k sufficiently large that there is an a; < fc such that x G W/j^ AW/i^. for 
alli,j<n such that Wh^ 7^ W;i^ . 

All hypotheses made by Mg on extensions of ae,k contained in Lg must have the 
same intersection with [0, fc— 1] as Wm^{(t^ k)- the choice of k, all such hypotheses 
must code the same set, yet £g contains two sets that extend ae.k and have infinite 
symmetric difference: content(cre,fe) Ui?g and content(a-g,/c) Ui?*. Again, we witness 
failure by Me to TxtFext*-learn Ce- 

Above we have seen the success of each step of the diagonalization to defeat 
TxtFext*-learning attempts. Consequently, £ = Ue£N '^e ^'-'^ TxtFext*-learnable. 
We must now verify that jC is indeed TxtFex*-learnable. 

Every set in £ is a finite variant of Re or _R* for some e G N. Therefore, 
a learner need only identify the appropriate e and to which of Re and i?* the 
input enumeration is most similar. Since Rg is co-even and i?* is co-odd, they are 
identifiable by the numbers not in them. Let Xe and .t* be codes for Re and i?*, 
respectively. For notational ease, let nia- = min (content (cr)) and Ua- = min({y > 
rricr ■ y ^ content (ct)}). Given a, an intial segment of an enumeration for a set in 
£, is the current guess at the least member of the set (hence the e for which 
the set is in £g) and Ua- is the current guess at the least element of Lg not in the 
set. Define a machine M as follows: 
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{0 content (cr) = 

Xe e = Too- a (ricr is even) 
e — rricr A (ng- is odd) 

Suppose that AI is receiving an enumeration for L (z C. Every set in C is either 
of the form Re U Z3„ or R* U I?„. By the symmetric relationship between R^ and 
i?*, we may assume that L — Rf.U Dn for some specific e and n. We must consider 
two cases: R^ is either cofinite or coinfinite. 

If Re is cofinite, i?* is also cofinite. As a consequence, i?e ~* i?*. Eventually, 
the enumeration will exhibit some element of the set being enumerated. After that 
stage, M will either output Xe or a;*. Given the model of learning, both are correct 
hypotheses. If Re is coinfinite, then (N \ Re) [e, oo) is an unbounded set of even 
numbers. The target set is a finite variant of Re- Hence, the least element not in 
content((T) and greater than e will be even for cofinitely many initial segments of 
any enumeration. In other words, for cofinitely many initial segments, a, of any 
enumeration of L, n„ is even and M{a) = Xe, & code for a finite variant of the 
enumerated set. 

We have constructed a family C such that for each computable machine, C con- 
tains a subfamily that the machine cannot TxtFext*-learn, and we have exhibited 
a specific machine that TxtFex*-learns the whole family. This completes the proof. 

□ 

3. Conclusion 

As mentioned in the introduction, Fulk et al [2 proved several results about the 
relative strength of TxtFex and TxtFext learning, culminating in the conclusion 
that 

(Vj)(TxtFexJ, C TxtFext:). 

Combining this with our result, we observe the following intriguing relationship 
between the anomalous versions of the two learning criteria 

(Vj)(TxtFexi C TxtFext; C TxtFex^). 

A great number of other results about vacillatory learning are already known. 
Many of the results can be found in a paper of Case's [1] or in Osherson, Stob and 
Weinstein's book 3 . 
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